Rank in Wordlist | Frequency | Word |
---|---|---|
2043 | 1656 | 4,156 |
4533 | 674 | 1,000 |
5662 | 516 | 2,000 |
7543 | 358 | 3,000 |
8064 | 329 | 100,000 |
8426 | 310 | 1,700 |
8868 | 289 | 1,500 |
9101 | 279 | 10,000 |
9364 | 269 | 5,000 |
9430 | 267 | 20,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
105543 | 5 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
2217 | 1530 | 100% |
3683 | 864 | 50% |
4529 | 675 | 10% |
4570 | 669 | 20% |
5325 | 557 | 80% |
5353 | 551 | 30% |
5374 | 549 | 40% |
5533 | 530 | 90% |
5771 | 505 | 70% |
6302 | 453 | 60% |
Rank in Wordlist | Frequency | Word |
---|---|---|
1557 | 2150 | Atención |
1558 | 2150 | principalTítulos |
24471 | 67 | G&T |
26474 | 59 | S&P |
28009 | 54 | T&C |
31857 | 44 | más |
33419 | 40 | AT&T |
47009 | 23 | país |
53912 | 18 | día |
55023 | 17 | H&Y |
Rank in Wordlist | Frequency | Word |
---|---|---|
21907 | 80 | US$ 1 |
35162 | 37 | US$50 |
35745 | 36 | US$1 |
37671 | 33 | US$ 10 |
39882 | 30 | US$100 |
41508 | 28 | US$3 |
43308 | 26 | US$10 |
43309 | 26 | US$20 |
44314 | 25 | US$100.00 |
44315 | 25 | US$30 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2957 | 1096 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
13472 | 165 | .' |
34507 | 38 | John's |
34555 | 38 | Papa John's |
47236 | 23 | world's |
55174 | 17 | Moody's |
55327 | 17 | Shaw's |
59225 | 15 | ONG's |
62268 | 14 | d'urgell |
75657 | 10 | d'Aubuisson |
75658 | 10 | d'en |
Rank in Wordlist | Frequency | Word |
---|---|---|
12035 | 194 | Ángel+Sermeño+Quezada |
12394 | 185 | Carlos+Ernesto+Grande |
13477 | 165 | Departamento+de+Filosofía |
13593 | 163 | Carlos+Ayala+Ramírez |
13652 | 162 | José+María+Tojeira |
14323 | 151 | Gabriel+Escolán+Romero |
14581 | 147 | Luis+Eduardo+Aguilar |
14636 | 146 | Dirección+de+Comunicaciones |
15165 | 139 | Jorge+Alberto+Rodríguez |
30744 | 46 | 01+00:00 |
Rank in Wordlist | Frequency | Word |
---|---|---|
152555 | 3 | Sgr A* |
199300 | 2 | Sagitario A* |
349948 | 1 | Sagittarius A* |
Rank in Wordlist | Frequency | Word |
---|---|---|
1464 | 2278 | y/o |
4750 | 639 | https://www |
4752 | 639 | km/h |
6241 | 459 | https://bit |
6687 | 420 | 24/7 |
10962 | 219 | Oficinas/locales |
12380 | 186 | https://t |
15233 | 138 | 1/2 |
15694 | 132 | ISO/IEC |
18850 | 101 | OPS/OMS |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots